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ID , ' CA13_gg)VIN STANDARD; ;-PRT; 1049 AA. 

. AC ' P04258"; ^ 

DT 20-MAR-1987 (Rel . 04, Created) 

DT 20-MAR-1987 (Rel. 04, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE iSotlagen alpha l(III) chain. 

GN rOLSAl. 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913 ; 

RN [1] 

RP SEQUENCE OF 1-242. 

RX MEDLINE= 8 0 02 602 6 ; PubMed= 488906 ; 

RA Fietzek P.P., Allmann H., Rauterberg J., Henkel W., Wachter E., 

RA Kuhn K . ; 

RT "The covalent structure of calf skin type III collagen. I. The amino 

RT acid sequence of the amino terminal region of the alpha 1 (III) chain 

RT (positions 1-222)."; 

RL Hoppe-Seyler's Z. Physiol. Chem. 360:809-820(1979). 

RN [2] 

RP SEQUENCE OF 243-422. 

RX MEDLINE= 80026027 ; PubMed =488^07; 

RA Dewes H., Fietzek P.P., Kuhn K. ; 

RT "The covalent structure of calf skin type III collagen. II. The amino 

RT acid sequence of the cyanogen bromide peptide alpha 1 (III) CBl, 8 , 10, 2 

RT (positions 223-402)."; 

RL Hoppe-Seyler's Z. Physiol. Chem. 360:821-832(1979). 

RN [3] 

RP SEQUENCE OF 423-571. 

RX MEDLINE= 8 002 6Q2 8 ; PubMed= 488 908 ; 

RA Bentz H., Fietzek P.P., Kuhn K. ; 

RT "The covalent structure of calf skin type III collagen. III. The amino 

RT acid sequence of the cyanogen bromide peptide alpha 1(III)CB4 

RT (positions 403-551)."; 

RL Hoppe-Seyler's Z. Physiol. Chem. 360:833-840(1979). 

RN [4] 

RP SEQUENCE OF 572-808. 

RX MEDLINE=80026029; PubMed= 4 88 909 ; 

RA Lang H., Glanville R.W., Fietzek P.P., Kuhn K. ; 

RT "The covalent structure of calf skin type III collagen. IV. The amino 

RT acid sequence of the cyanogen bromide peptide alpha 1 (III) CBS 

RT (positions 552-788)."; 

RL Hoppe-Seyler's Z. Physiol. Chem. 360:841-850(1979). 

RN [5] 

RP SEQUENCE OF 809-947. 

RX MEDLINE= 8 002 603 0 ; PubMed= 4 88910 ; 

RA Dewes H., Fietzek P.P., Kuhn K. ; 

RT "The covalent structure of calf skin type III collagen. V. The amino 

RT acid sequence of the cyanogen bromide peptide alpha 1(III)CB9A 

RT (position 789-927)."; 

RL Hoppe-Seyler's Z. Physiol. Chem. 3 60:851-860(1979). 

RN [6] 

RP SEQUENCE OF 948-1049. 

RX MEDLINE= 8 002 6031 ; PubMed= 488911 ; 

RA Allmann H., Fietzek P.P., Glanville R.W., Kuhn K. ; 

RT "The covalent structure of calf skin type III collagen. VI. The amino 

RT acid sequence of the carboxyterminal cyanogen bromide peptide alpha 

RT 1(III)CB9B (positions 928-1028)."; 

RL Hoppe-Seyler's Z. Physiol. Chem. 360:861-868(1979). 

CC -!- FUNCTION: COLLAGEN TYPE III OCCURS IN MOST SOFT CONNECTIVE TISSUES 
CC ALONG WITH TYPE I COLLAGEN. 

CC -!- SUBONIT: TRIMERS OF IDENTICAL ALPHA l(III) CHAINS. THE CHAINS ARE 
CC LINKED TO EACH OTHER BY INTERCHAIN DISULFIDE BONDS. TRIMERS ARE 
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CC,-- "ALSO CROSS-LINKED VIA HYDROXYLYSINES . 

CC -!- PTM: PROLINES AT THE THIRD POSITION OF THE TRIPEPTIDE REPEATING 
CC UNIT (G-X-Y) ARE HYDROXYLATED IN SOME OR ALL OF THE CHAINS. 

DR FIR; A02 862; CGB07S. 

DR InterPro; IPR000087 ; Collagen. 

DR InterPro; IPR001007 ; VWF_C. 

DR Pfam; PF01391 ; Collagen; 17. 

DR ProDom; PD000007 ; Gpllagen; 3.^ 

DR PROSITE; PS01208 ; VWFC_1 ; PARTIAL. 



KW 


Extracellular matrix; Connective tissue; Repeat; Hydroxylation, 


KW 


Glycoprotein; Collagen. 


NONHELICAL REGION (N-TERMINAL) . 


FT 


DOMAIN 


1 


14 


FT 


DOMAIN 


15 


1040 


TRIPLE-HELICAL REGION. 


FT 


DOMAIN 


1041 


1049 


NONHELICAL REGION (C-TERMINAL) . 


FT 


MOD RES 


95 


95 


HYDROXYLATION. 


FT 


MOD RES 


107 


107 


HYDROXYLATION. 


FT 


MOD RES 


119 


119 


HYDROXYLATION . 


FT 


MOD RES 


938 


938 


HYDROXYLATION . 


FT 


MOD RES 


950 


950 


HYDROXYLATION . 


FT 


CARBOHYD 


107 


107 


O-LINKED (GAL. . . ) . 


FT 


CARBOHYD 


950 


950 


0-LINKED (GAL. . . ) . 


FT 


DISULFID 


1040 


1040 


INTERCHAIN . 


FT 


DISULFID 


1041 


1041 


INTERCHAIN . 



SQ SEQUENCE ,1-049 AA; 93651 MW; 8EEC33D1C66EC9A3 CRC64 ; 

EYEAYDVKSG 'VAGGGIAGYP GPAGPPGPPG PPGTSGHPGA PGAPGYQGPP GEPGQAGPAG 

PPGPPGAIGP SGKDGESGRP GRPGPRGFPG PPGMKGPAGM PGFPGMKGHR GFDGRNGEKG 

EPGAPGLKGE NGVPGEDGAP GPMGPRGAPG ERGRPGLPGA AGARGNDGAR GSDGQPGPPG 

PPGTAGFPGS PGAKGEVGPA GSPGSSGAPG QRGEPGPQGH AGAPGPPGPP GSDGSPGGKG 

EMGPAGIPGA PGLIGARGPP GPPGTNGVPG QRGAAGEPGK NGAKGDPGPR GERGEAGSPG 

lAGPKGEDGK DGSPGEPGAN GLPGAAGERG VPGFRGPAGA NGLPGEKGPP GDRGGPGPAG 

PRGVAGEPGR NGLPGGPGLR GIPGSPGGPG SNGKPGPPGS QGETGRPGPP GSPGPRGQPG 

VMGFPGPKGN DGAPGKNGER GGPGGPGPQG PAGKNGETGP QGPPGPTGPS GDKGDTGPPG 

PQGLQGLPGT SGPPGENGKP GEPGPKGEAG APGIPGGKGD SGAPGERGPP GAGGPPGPRG 

GAGPPGPEGG KGAAGPPGPP GSAGTPGLQG MPGERGGPGG PGPKGDKGEP GSSGVDGAPG 

KDGPRGPTGP IGPPGPAGQP GDKGESGAPG VPGIAGPRGG PGERGEQGPP GPAGFPGAPG 

QNGEPGAKGE RGAPGEKGEG GPPGAAGPAG GSGPAGPPGP QGVKGERGSP GGPGAAGFPG 

GRGPPGPPGS NGNPGPPGSS GAPGKDGPPG PPGSNGAPGS PGISGPKGDS GPPGERGAPG 

PQGPPGAPGP LGIAGLTGAR GLAGPPGMPG ARGSPGPQGI KGENGKPGPS GQNGERGPPG 

PQGLPGLAGT AGEPGRDGNP GSDGLPGRDG APGAKGDRGE NGSPGAPGAP GHPGPPGPVG 

PAGKSGDRGE TGPAGPSGAP GPAGSRGPPG PQGPRGDKGE TGERGAMGIK GHRGFPGNPG 

APGSPGPAGH QGAVGSPGPA GPRGPVGPSG PPGKDGASGH PGPIGPPGPR GNRGERGSEG 

SPGHPGQPGP PGPPGAPGPC CGAGGVAAI 

// 
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ID ■4CA13_HUMAN STANDARD; PRT^f 1466 AA. 

AC P0246'l; Q15112; ' 

DT 21- JUL- 1986 (Rel. 01, Created) 

DT 01- JAN- 1990 (Rel. 13, Last sequence update) 

DT 15-JUN-2002 (Rel. 41, Last annotation update) 

DE Gjollagen alpha l(III) chain precursor. 

GN C0L3A1 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini ; Hominidae; Homo. 

OX NOB I_TaxID= 9506; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=SKIN FIBROBLAST; 

RX MEDLINE =89250838; PubMed= 2 7 64 8 86 ; 

RA Ala-Kokko L., Kontusaari S., Baldwin C.T., Kuivaniemi H . , 

RA Prockop D. J. ; 

RT "Structure of cDNA clones coding for the entire prepro alpha 1 (III) 

RT chain of human type III procollagen. Differences in protein structure 

RT from type I procollagen and conservation of codon preferences."; 

RL Biochem. J. 260:509-516(1989). 

RN [2] 

RP SEQUENCE OF 149-1225 FROM N.A. 

RX MEDLINE= 89386015 ; PubMed= 27 803 04 ; 

RA Janeczko R.A. , Ramirez F.; 

RT "Nucleotide and amino acid sequences of the entire human alpha 1 (III) 

RT collagen."; 

RL Nucleic Acids Res. 17:6742-6742(1989). 

RN [3] 

RP SEQUENCE OF 168-398. 

RX MEDLINE= 77134724; PubMed = 557335 ; 

RA Seyer J.M., Rang A.H.; 

RT "Covalent structure of collagen: amino acid sequence of cyanogen 

RT bromide peptides from the amino-terminal segment of type III collagen 

RT of human liver."; 

RL Biochemistry 16:1158-1164(1977). 

RN [4] 

RP REVISIONS. 

RA Seyer J . M . ; 

RL Submitted (DEC-1977) to the PIR data bank. 

RN [5] 

RP SEQUENCE OF 399-727. 

RX MEDLINE =79000343; PubMed= 6 8 75 91 ; 

RA Seyer J.M., Kang A.H.; 

RT "Covalent structure of collagen: amino acid sequence of five 

RT consecutive CNBr peptides from type III collagen of human liver."; 

RL Biochemistry 17:34 04-3411(1978). 

RN [6] 

RP SEQUENCE OF 728-964. 

RX MEDLINE= 801982 82 ; PubMed=6246925 ; 

RA Seyer J.M., Mainardi C, Kang A.H.; 

RT "Covalent structure of collagen: amino acid sequence of alpha 1 (III)- 

RT CBS from type III collagen of human liver."; 

RL Biochemistry 19:1583-1589(1980). 

RN [7] 

RP SEQUENCE OF 950-1466 FROM N.A. 

RX MEDLINE= 88189827; PubMed= 3357782 ; 

RA Mankoo B.S., Dalgleish R.; 

RT "Human pro alpha l(III) collagen: cDNA sequence for the 3' end."; 

RL Nucleic Acids Res. 16:2337-2337(1988). 

RN [8] 

RP REVISION TO 1184. 

RX MEDLINE= 89098346 ; PubMed= 3 2 1 17 6 0 ; 

RA Molyneux K. , Dalgleish R.; 

RT "Human type III collagen 'variant' is a cDNA cloning artefact."; 
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VARIANT 
VARIANT 
VARIANT 
VARIANT 
VARIANT 



VARIANT 
VARIANT 



CONFLICT 
CONFLICT 
CONFLICT 
CONFLICT 
CONFLICT 
CONFLICT 
CONFLICT 
CONFLICT 
CONFLICT 
CONFLICT 
CONFLICT 
CONFLICT 
CONFLICT 



CONFLICT 

CONFLICT 

CONFLICT 

CONFLICT 

CONFLICT 

CONFLICT 

CONFLICT 

CONFLICT 

CONFLICT 

CONFLICT 

SEQUENCE 

MMSFVQKGSW 

LCDDIICDDQ 

GDPGIPGQPG 

PPGPPGTSGH 

ERGLPGPPGI 



163 
168 
226 
241 
278 
293 
472 

614 
635 
664 
676 
896 



1097 
1153 
1156 
1156 



1184 
1203 
1210 
1241 
1357 
1466 AA 



NGA 
E -> 
PGF 



163 
168 
228 
241 
278 
295 
472 
490 

614 T ■ 

635 P ■ 

664 D • 

676 D ■ 

896 S • 

980 S ■ 

98 9 ANC 

1019 D • 

1097 T ■ 

1154 TS 

1156 H ■ 

1156 H ■ 

1184 P • 

1203 A ■ 

1210 G ■ 

1241 V • 

1357 L ■ 
138556 MW; 



G -> V (IN EDS -IV) . 
/FTID=VAR_01115 7 . 
G -> E (IN EDS -IV) . 
/FTID=VAR_0 01801. 

G -> R (IN GROTTON TYPE ACROGERIA) 
/FTID=VAR_01115 8 . 
G -> V (IN EDS) . 
/FTID=VAR_0 01802 . 
G -> R (IN EDS-IV) . 
/FTID=VAR_011159 . 
G -> E (IN EDS) . 
/FTID=VAR_001803 . 

G -> D (IN EDS-IV; SEVERE VARIANT) 
/FTID=VAR_001804 . 
G -> V (IN EDS-IV) . 
/FTID=VAR_001805 . 

G -> E (IN EDS-IV; SEVERE VARIANT) 
/FTID=VAR__001806 . 
G -> R (IN EDS-IV) . 
/FTID=VAR_001807 . 
G -> GG (IN REF. 2) . 
G -> V (IN REF. 4) . 
MISSING (IN REF. 4) . 



(IN REF. 2) . 

(IN REF. 4) . 
DGS (IN REF. 

(IN REF. 2) . 
LGS (IN REF. 



> Y (IN REF. 

> R (IN REF. 

> E (IN REF. 

> N (IN REF. 

> A (IN REF. 6) . 

> A (IN REF. 10) . 

US -> PSGQN (IN REF. 

> Y (IN REF. 7) . 

> P (IN REF. 10) . . 
-> AT (IN REF. 10) . 

> P (IN REF. 9) . 

> S (IN REF. 

> S (IN REF. 

> P (IN REF. 

> A (IN REF. 

> A (IN REF. 

> P (IN REF. 
990E3ACB33A1 



2) . 
2) . 
2) . 
5) . 



10) . 
2) . 
2) . 
2) . 

7 AND 9) . 

11) . 

0847 CRC64; 



SNGAPGQRGE 
ANGAPGLRGG 
AAGERGAPGF 
SPGGPGSDGK 
GPGPQGPPGK 
PKGDAGAPGA 
TPGLQGMPGE 
EGGAPGLPGI 
VAGPPGGSGP 
KDGPPGPAGN 
PPGMPGPRGS 
LPGRDGSPGG 
SRGAPGPQGP 



LLLALLHPTI ILAQQEAVEG GCSHLGQSYA DRDVWKPEPC QICVCDSGSV 
ELDCPNPEIP FGECCAVCPQ PPTAPTRPPN GQGPQGPKGD PGPPGIPGRN 
SPGSPGPPGI CESCPTGPQN YSPQYDSYDV KSGVAVGGLA GYPGPAGPPG 
PGSPGSPGYQ GPPGEPGQAG PSGPPGPPGA IGPSGPAGKD GESGRPGRPG 
KGPAGIPGFP GMKGHRGFDG RNGEKGETGA PGLKGENGLP GENGAPGPMG 
PGLPGAAGAR GNDGARGSDG QPGPPGPPGT AGFPGSPGAK GEVGPAGSPG 
PGPQGHAGAQ GPPGPPGING SPGGKGEMGP AGIPGAPGLM GARGPPGPAG 
AGEPGKNGAK GEPGPRGERG EAGIPGVPGA KGEDGKDGSP GEPGANGLPG 
RGPAGPNGIP GEKGPAGERG APGPAGPRGA AGEPGRDGVP GGPGMRGMPG 
PGPPGSQGES GRPGPPGPSG PRGQPGVMGF PGPKGNDGAP GKNGERGGPG 
NGETGPQGPP GPTGPGGDKG DTGPPGPQGL QGLPGTGGPP GENGKPGEPG 
PGGKGDAGAP GERGPPGLAG APGLRGGAGP PGPEGGKGAA GPPGPPGAAG 
RGGLGSPGPK GDKGEPGGPG ADGVPGKDGP RGPTGPIGPP GPAGQPGDKG 
AGPRGSPGER GETGPPGPAG FPGAPGQNGE PGGKGERGAP GEKGEGGPPG 
AGPPGPQGVK GERGSPGGPG AAGFPGARGL PGPPGSNGNP GPPGPSGSPG 
TGAPGSPGVS GPKGDAGQPG EKGSPGAQGP PGAPGPLGIA GITGARGLAG 
PGPQGVKGES GKPGANGLSG ERGPPGPQGL PGLAGTAGEP GRDGNPGSDG 
KGDRGENGSP GAPGAPGHPG PPGPVGPAGK SGDRGESGPA GPAGAPGPAG 
RGDKGETGER GAAGIKGHRG FPGNPGAPGS PGPAGQQGAI GSPGPAGPRG 
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PVGPSGPPGK DGTSGHPGPI 
GAAAIAGIGG EKAGGFAPYY 
NCRDLKFCHP ELKSGEYWVD 
AEKKHVWFGE SMDGGFQFSY 
QASGNVKKAL KLMGSNEGEF 
IVDIAPYDIG GPDQEFGVDV 



GPPGPRGNRG ERGSEGSPGH 
GDEPMDFKIN TDEIMTSLKS 
PNQGCKLDAI KVFCNMETGE 
GNPELPEDVL DVQLAFLRLL 
KAEGNSKFTY TVLEDGCTKH 
GPVCFL 



PGQPGPPGPP GAPGPCCGGV 

VNGQIESLIS PDGSRKNPAR 

TCISANPLNV PRKHWWTDSS 

SSRASQNITY HCKNSIAYMD 

TGEWSKTVFE YRTRKAVRLP 
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